533 Rec'd PCT^TO 13 AUG 2001 



SUBSTITUTE FORM PTO-1390 



U S DEPARTMENT OF COMMERCE PATENT AND TRADEMARK OFFICE 



TRANSMITTAL LETTER TO THE UNITED STATES 
DESIGNATED/ELECTED OFFICE (DO/EO/US) 
CONCERNING A FILING UNDER 35 U.S.C. 371 



ATTORNEY'S DOCKET NUMBER 
08106-005001 



U.S. APPLICATION NO. (If Known, see 37 CFR 



1.5) 



09/9134U 



INTERNATIONAL APPLICATION NO. 
PCT/CA00/00173 



INTERNATIONAL FILING DATE 

21 February 2000 



PRIORITY DATE CLAIMED 

19 February 1999 



TITLE OF INVENTION 

PRODUCTION OF HETEROLOGOUS POLYPEPTIDES FROM FRESHWATER CAULOBACTER 



APPLICANT^) FOR DO/EO/US 
John Smit 



Applicant herewith submits to the United States Designated/Elected Office (DO/EO/US) the following items and other information: 

1. S This is a FIRST submission of items concerning a filing under 35 U.S.C. 371. 

2. □ This is a SECOND or SUBSEQUENT submission of items concerning a filing under 35 U.S.C. 371 . 

3. □ This is an express request to promptly begin national examination procedures (35 U.S.C. 371(f)). 

4. □ The US has been elected by the expiration of 19 months from the priority date (PCT Article 31). 

5. IEI A copy of the International Application as filed (35 U.S.C. 371 (c)(2)) 

a. |EI is attached hereto (required only if not communicated by the International Bureau). 

b. □ has been communicated by the International Bureau. 

q c. □ is not required, as the application was filed in the United States Receiving Office (RO/US). 

ft] An English language translation of the International Application as filed (35 U.S.C. 371(c)(2)). 

23 Amendments to the claims of the International Application under PCT Article 19 (35 U.S.C. 371(c)(3)) 

L t a. □ are attached hereto (required only if not communicated by the International Bureau). 

% b, □ have been communicated by the International Bureau. 

^7 c. □ have not been made; however, the time limit for making such amendments has NOT expired. 

^ d. have not been made and will not be made. 

8. p An English language translation of amendments to the claims under PCT Article 19 (35 U.S.C. 371(c)(3)). 

9. VQ An oath or declaration of the inventor(s) (35 U.S.C. 371(c)(4)). 

10. H An English language translation of the annexes to the International Preliminary Examination Report under 
|W PCT Article 36 (35 U.S.C. 371(c)(5)). 

Itenis 1 1 to 16 below concern other documents or information included: 



6. 
7. 



11 



An Information Disclosure Statement under 37 CFR 1.97 and 1.98. 



12. □ An assignment document for recording. A separate cover sheet in compliance with 37 CFR 3.28 and 3.31 is 

included. 

13. [2 A FIRST preliminary amendment. 

□ A SECOND or SUBSEQUENT preliminary amendment. 

14. □ A substitute specification. 

1 5. □ A change of power of attorney and/or address letter. 

16. Other items or information: 



□ 
□ 
□ 



International Search Report with cited references. 
International Preliminary Examination Report. 



CERTIFICATE OF MAILING BY EXPRESS MAIL 



Express Mail Label n £L Mgfr.Sl M^faQ U 3 



I hereby certify under 37 CFR §110 that this correspondence is being deposited with the United States Postal Service as Express Mail 
Post Office to Addressee with sufficient postage on the date indicated below and is addressed to the Commissioner for Patents, 
Washington, D.C. 20231 



Date dtfDeposit / 




Signature 




mMlrj M i 



Typed Name of 
Person Signing 



msmo^mmo 73 aug2oqi 



U.S. APPLICATION NO. Xli^Xrj|pWW) i «| I I INTERNATIONAL APPLICATION NO. 

f»Q/ Hi P41*T I PCT/CA00/00173 

17. The following fees are submitted: 

Basic National Fee ( 37 CFR 1.492(a)(1)- (5) ): 

Neither international preliminary examination fee (37 CFR 1.482) 
nor international search fee (37 CFR 1.445(a)(2)) paid to USPTO 
and International Search Report not prepared by the EPO or JPO $1000 

International preliminary examination fee (37 CFR 1482) not paid to 

USPTO but International Search Report prepared by the EPO or JPO $860 

International preliminary examination fee (37 CFR 1.482) not paid to USPTO but 
international search fee (37 CFR 1.445(a)(2)) paid to USPTO $710 

International preliminary examination fee paid to USPTO (37 CFR 1.482) 

but all claims did not satisfy provisions of PCT Article 33(1 )-(4) $690 

International preliminary examination fee paid to USPTO (37 CFR 1.482) 

and all claims satisfied provisions of PCT Article 33(1 )-(4) $100 



ATTORNEY'S DOCKET NUMBER 

08106-005001 



CALCULATIONS PTO USE 

ONLY 



ENTER APPROPRIATE BASIC FEE AMOUNT = 


$860.00 




Surcharge of $130 for furnishing the oath or declaration later than □ 20 □ 30 
months from the earliest claimed priority date (37 CFR 1.492(e)). 


$0.00 




O Claims 


Number Filed 


Number Extra 


Rate 






TotaMllaims 


20 - 20 = 




x$18 


$0.00 




Indejiendent Claims 


4- 3 = 


1 


x$80 


$80.00 




MUUTJPLE DEPENDENT CLAIMS(S) (if applicable) 


+ $270 


$0.00 








TOTAL OF ABOVE CALCULATIONS = 


$940.00 




U /Replicant claims small entity status. See 37 CFR 1.27. The fees indicated above are 
reduced by 1/2. 


$0.00 










SUBTOTAL = 


$940.00 




Processing fee of $130 for furnishing the English Translation later than □ 20 □ 30 
moifhs from the earliest claimed priority date (37 CFR 1.492(f)) 


$0.00 








TOTAL NATIONAL FEE = 


$940.00 




Fee for recording the enclosed assignment (37 CFR 1 .21 (h)). The assignment must be 
acctiUpanied by an appropriate cover sheet (37 CFR 3.28, 3.31 ).$40.00 per property + 


$0.00 








TOTAL FEES ENCLOSED = 


$940.00 












Amount to be 
refunded: 


$ 










Charged: 


$ 



lEl A check in the amount of $940.00 to cover the above fees is enclosed. 

□ Please charge my Deposit Account No. 06-1050 in the amount of $0.00 to cover the above fees. A duplicate 

copy of this sheet is enclosed. 
IEl The Commissioner is hereby authorized to charge any additional fees which may be required, or credit any 

overpayment to Deposit Account No. 06-1 050. A duplicate copy of this sheet is enclosed. 



NOTE: Where an appropriate time limit under 37 CFR 1.494 or 1.495 has not been met, a petition to revive 
(37 CFR 1.137(a) or (b) must be filed and granted to restore the application to pending status. 

SEND ALL CORRESPONDENCE TO: 



Y. Rocky Tsao 

FISH & RICHARDSON P.C. 

225 Franklin Street 

Boston, Massachusetts 02110-2804 

(617) 542-5070 phone 

(617) 542-8906 facsimile 



»4r 



SIGNATURE : ff 






Y. Rockv Tsao 


NAME 






34.053 


REGISTRATION NUMBER 



20301163.doc 




IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



Applicant 
Serial No. 



John Smit 

N/A 



Art Unit : Unknown 
Examiner : Unknown 



Filed : Herewith 

Title : PRODUCTION OF HETEROLOGOUS POLYPEPTIDES FROM 
FRESHWATER CAULOBACTER 

Commissioner for Patents 
Washington, D.C. 20231 



Prior to examination, please amend the application as follows: 
In the claims : 

Amend claims 4,10-12, 15 as follows: 

- 4. (Amended) The cell of claim 1 wherein the DNA construct further comprises 
an operably linked promoter recognized by the cell. 

-10. (Amended) The DNA construct of claim 9 further comprising an operably linked 
promoter recognized by Caulobacter . 

-11. (Amended) The DNA construct of claim 8 wherein the secretion signal has an 
amino acid sequence which does not comprise SEQ ID NO:l. 

--12. (Amended) A bacterial cell comprising a DNA construct of claim 9. 
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-15. (Amended) The cell of claim 13 wherein the DNA construct further comprises an 
operably linked promoter recognized by Caulobacter w herein the DNA construct is 
expressed in the cell and the protein so expressed is secreted by the cell 

Add claims 17-20. 

- 17. The cell of claim 2 wherein the DNA construct further comprises an operably 
linked promoter recognized by the cell. 

18. The cell of claim 3 wherein the DNA construct further comprises an operably 
linked promoter recognized by the cell. 

-19. The DNA construct of claim 9 wherein the secretion signal has an amino acid 
sequence which does not comprise SEQ ID NO:l. 

-20. The DNA construct of claim 10 wherein the secretion signal has an amino acid 
sequence which does not comprise SEQ ID NO:l. 
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Version with markings to show changes made 



In the claims: 

Claim 4,10-12, 15 has been amended as follows: 

4. (Amended) The cell of claim 1 [,2 or 3] wherein the DNA construct further 
comprises an operably linked promoter recognized by the cell. 

-10. (Amended) The DNA construct of claim 9 [or 10] further comprising an operably 
linked promoter recognized by Caulobacter , 

—11. (Amended) The DNA construct of claim 8 [,9 or 10,] wherein the secretion signal 
has an amino acid sequence which does not comprise SEQ ID NO: 1 . 

-12. (Amended) A bacterial cell comprising a DNA construct of claim 9 [,10, or 1 1]. 

-15. (Amended) The cell of claim 13 [or 14,] wherein the DNA construct further 
comprises an operably linked promoter recognized by Caulobacter w herein the DNA 
construct is expressed in the cell and the protein so expressed is secreted by the cell. 
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PRODUCTIQNOF HETERQLDGQUS 
POLYPEPTO 

Field of Invention 

5 

This invention relates to the use of the Caulobacter surface layer protein 
(S-iayer protein) transport system for the expression and secretion of heterologous 
polypeptides from a host organism. 

10 Background of the invention 

Many genera of bacteria assemble layers composed of repetitive, regularly 
aligned, proteinaceous sub-units on the outer surface of the cell. These layers are 
essentially two-dimensional paracrystalline arrays, and being the outer molecular layer 
15 of the organism, directly interface with the environment. Such layers are commonly 
known as S-layers and are found on members of every taxonomic group of walled 
bacteria including: Archaebacteria ; Chlamydia ; Cyanobacteria ; Acinetobacter ; 
Bacillus ; Aquaspirillum ; Caulobacter ; Clostridium ; Chromarium . Typically, an 
S-layer will be composed of an intricate, geometric array of at least one major protein 
2 0 having a repetitive regular structure. In many cases, such as in Caulobacter , the 
S-layer protein is synthesized by the cell in large quantities and the S-layer completely 
envelopes the cell and thus appears to be a protective layer. 



Caulobacter are natural inhabitants of most soil and freshwater environments 

2 5 and may persist in waste water treatment systems and effluents. The bacteria alternate 

between a stalked cell that is attached to a surface, and an adhesive motile dispersal 
cell that searches to find a new surface upon which to stick and convert to a stalked 
cell. The bacteria attach tenaciously to nearly all surfaces and do so without 
producing the extracellular enzymes or polysaccharide "slimes" that are characteristic 

3 0 of most other surface attached bacteria. They have simple requirements for growth. 

The organism is ubiquitous in the environment and has been isolated from 
oligotrophy to mesotrophic situations. Caulobacters are known for their ability to 
tolerate low nutrient level stresses, for example, low phosphate levels. This nutrient 
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can be limiting in many leachate waste streams, especially those with high levels of 
iron or calcium. 

Freshwater Caulobacter producing S-layers may be readily detected by 
5 negative stain transmission electron microscopy techniques, Caulobacter may be 
isolated using the methods outlined by MacRae, J.D. and Smit (1991) Applied and 
Environmental Microbiology 57:751-758, which take advantage of the fact that 
Caulobacter can tolerate periods of starvation while other soil and water bacteria may 
not and that they all produce a distinctive stalk structure, visible by light microscopy 
10 (using either phase contrast or standard dye staining methods). Once Caulobacter 
strains are isolated in a typical procedure, colonies are suspended in 2% ammonium 
molybdate negative stain and applied to plastic-filmed, carbon-stabilized 300 or 
400 mesh copper or nickel grids and examined in a transmission electron microscope 
at 60 kilovolt accelerating voltage, as described in Smit, J. ^(1986) "Protein Surface 
15 Layers of Bacteria", in Outer Membranes as Model System , M. Inouge, 
Ed. J. Wiley & Sons, at page 343-376. S-layers are seen a two-dimensional 
geometric patterns most readily on those cells in a colony that have lysed and released 
their internal contents. 

2 0 The S-layer of different freshwater Caulobacter is hexagonally arranged with a 

similar centre-centre dimension and antisera raised against the S-layer protein of 
C. crescentus strain CB15 reacts with S-layer proteins from other Caulobacter 
(see: Walker, S.G., eral. (1992) (J. Bacteriol. 174:1783-1792). All S-layer proteins 
isolated from Caulobacter may be substantially purified using the same extraction 

2 5 method (pH extraction). All strains appear to have a lipopolysaccharide (LPS) 

reactive with antisera against the CB15 strain lipopolysaccharide species. The LPS 
appears to be required for S-layer attachment. 

The S-layer elaborated by freshwater isolates of Caulobacter are visibly 

3 0 indistinguishable from the S-layer produced by Caulobacter crescentus strains CB2 

and CB15. The S-layer proteins from the latter strains have approximately 
100,000 m.w. although sizes of S-layer proteins from other species and strains will 
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vary. The protein has been characterized both structurally and chemically. It is 
composed of ring-like structures spaced at 22nm intervals arranged in a hexagonal 
manner on the outer membrane. The S-layer is bound to the bacterial surface and may- 
be removed by low pH treatment or by treatment with a calcium chelator such as 
5 EDTA. 



The similarity of S-layer proteins in different strains of Caulobacter permits 
the use of a cloned S-layer protein gene of one Caulobacter strain for retrieval of the 
corresponding gene in other Caulobacter strains (see: Walker, S.G. et al . (1992) 
10 [ supra ]; and, MacRae, J.D. et_al. (1991) [ supra ] . 



Expression, secretion and optionally, presentation of a heterologous 
polypeptide in Caulobacter provides advantages not previously seen in systems using 
organisms such as E. coli and Salmonella in which fusion products using different 

1 5 surface proteins have been reported. All known Caulobacter strains are believed to be 
harmless and are nearly ubiquitous in aquatic environments. In contrast, many 
Salmonella and E. coli strains are pathogens. Consequently, expression and secretion 
of a heterologous polypeptide using Caulobacter as a vehicle will have the advantage 
that the expression system will be stable in a variety of outdoor environments and may 

2 0 not present problems associated with the use of a pathogenic organism. Furthermore. 
Caulobacter are natural biofilm forming species and may be adapted for use in fixed 
biofilm bioreactors. The quantity of S-layer protein that is synthesized and is secreted 
by Caulobacter is high, reaching 12% of the cell protein. The unique characteristics 
of the repetitive, two-dimensional S-layer would also make such bacteria ideal for use 

2 5 as an expression system, or as a presentation surface for heterologous polypeptides. 

This is desirable in a live vaccine to maximize presentation of the antigen or antigenic 
epitope. In addition, use of such a presentation surface to achieve maximal exposure 
of a desired polypeptide to the environment results in such bacteria being particularly 
suited for use in bioreactors or as carriers for the polypeptide in aqueous or terrestrial 

3 0 outdoor environments. 



The invention described in the PCT application published September 18. 1997 
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under WO 97/34000 describes the C-tenninal region of Caulobacter crescentus 
S-layer protein as being essential for secretion of S-layer protein in that species. 
Heterologous polypeptides may be conveniently expressed and secreted by a host 
Caulobacter when the polypeptide is expressed as a fusion with the C-terminal 
5 secretion signal. Further studies with C. crescentus have demonstrated that the 
species employs a type I secretion system which involves an uncleaved C-terminal 
secretion signal on the surface layer protein (RsaA) and several transport proteins 
encoded by genes 3' to the surface layer protein gene (rsaA) (Amram, P. and Srnit, J, 
(1998) Journal of Bacteriology 180:3062-3069). 

:o 

A typical type I secretion system uses three transport protein components. 
One such component, the ABC transporter, is embedded in the inner membrane, 
contains an ATP-binding region, recognizes the C-tenninal secretion signal of the 
substrate protein, and hydrolyzes ATP during the transport process. Another 
15 component, the membrane fusion protein (MFP) is anchored in the inner membrane 
and appears to span the periplasm. The remaining component is an outer membrane 
protein (OMP) that is thought to interact with the MFP to form a channel that extends 
from the cytoplasm through the two membranes to the outside of the cell. In 
C. crescentus , the ABC transporter and the MFP proteins have been termed RsaD and 
2 0 RsaE (respectively) and their genes are immediately 3' of rsaA. Further downstream 
is the rsaF gene which is believed to encode the OMP. 

It is desirable to provide for the use of Caulobacter species other than 
C. crescentus in the expression and secretion of heterologous polypeptides from a host 

2 5 organism. 

Summary of Invention 

This invention is based on the discovery that S-layer producing freshwater 

3 0 Caulobacter (other than C . crescentus ) rely on a type I secretion signal located at the 

C-terminus of the S-layer protein and highly conserved transport proteins. While the 
secretion signal itself is not as well conserved as the transport proteins, a secretion 
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signal from a first species of Caulobacter will be recognized by the transport 
mechanism of other species. Thus, a surface layer protein secretion signal derived 
from any freshwater S-layer producing Caulobacter may be used in the invention 
described in WO 97/34000. As well, any Caulobacter which contains a type I 
5 secretion system may be used as a host organism for the expression and secretion of 
heterologous polypeptides to which a Caulobacter S-laver protein secretion signal has 
been fused. Nucleic acid constructs made for expression of heterologous polypeptides 
may include a surface layer protein secretion signal from a Caulobacter other than 
C. crescentus, for expression in the same species from which the surface layer protein 
10 signal was derived or for expression in a different species. Furthermore, a C-terrninal 
secretion signal derived from the S-layer protein (RsaA) of C. crescentus . may be 
used in such transformation of Caulobacter other than C. crescentus . 

This invention also provides the use of Caulobacter other than C. crescentus as 
15 a host organism for the expression of polypeptides heterologous to a surface layer 
protein of the Caulobacter , wherein the Caulobacter has at least one surface layer 
transport protein that is homologous to RsaD or Rsa£ of C. crescentus . This 
invention also provides a method for identifying a candidate Caulobacter for such use, 
comprising extracting DNA from the Caulobacter , contacting the DNA with an 
2 0 oligonucleotide that is selectively hybridizable to one of rsaD and rsaE of 
C. crescentus , and detennining whether the oligonucleotide hybridizes to the DNA. 
The sequences of RsaD and RsaE and coding sequences rsaD and rsaE are known. 

This invention also provides a Caulobacter host, wherein the host comprises at 

2 5 least one surface layer transport protein having an amino acid sequence homologous 

to RsaD or RsaE. and wherein the host further comprises a DNA construct for 
expression of a polypeptide heterologous to a surface layer protein of the host, the 
construct comprising DNA encoding a heterologous polypeptide 5' to and operably 
linked with DNA encoding a Caulobacter surface layer protein secretion signal, with 

3 0 the proviso that when the host comprises transport proteins having sequences the same 

as both RsaD and RsaE, the secretion signal is not from C. crescentus. 
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This invention also provides a DNA construct comprising one or more 
restriction sites for facilitating insertion of DNA into the construct, wherein the 
construct further comprises DNA encoding a Caulobacter surface layer protein 
secretion signal not present in C. crescentus . 

5 

This invention also provides a DNA construct for expression of a heterologous 
polypeptide comprising DNA encoding a polypeptide not present in Caulobacter 
surface layer protein 5 ! from and operatively linked to DNA encoding a surface layer 
protein secretion signal not present in C. crescentus . 

10 

A surface layer protein secretion signal not present in C crescentus will 
function as such a signal in a Caulobacter type I secretion system but will not have an 
amino acid sequence that is the same as amino acids 945-1026 of the Rsa protein of 
C. crescentus . The latter sequence (SEQ ID NO:l) is: 

15 

AfGAAVTLGAAATI^QYLDAAAAGDGSG 
ATFVSGADAVTKLTGLVTLTTSAFATEVLTLA 

This invention also provides a bacterial cell comprising the aforementioned 
2 0 DNA constructs. Where the bacterial cell is other than C crescentus , the DNA 
construct may comprise a surface layer protein secretion signal derived from RsaA. 
This invention also provides the use of the aforementioned DNA constructs for 
transformation of bacterial cells and the use of such cells for expression and secretion 
of polypeptides heterologous to the cell. Where the cell is Caulobacter . the 
2 5 polypeptide is heterologous to the S-layer protein of the cell. This invention also 
provides proteins comprising heterologous material, secreted from a Caulobacter in 
which the secretion signal is not found in C. crescentus . 
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Description of the Drawings 

For better understanding of this invention, reference may be made to the 
preferred embodiments and examples described below, and the accompanying 
5 drawing in which; 

Figure 1 shows the organization of the C. crescentus genome with respect to 
the surface layer protein subunit gene (rsaA) and the downstream (3') type I transport 
protein genes: rsaD (encodes the ABC - transporter), rsaE (encodes the membrane 
10 fusion protein (MPF) and rsaF (encodes the outer membrane protein OMP). LPS 
genes A-F are involved in the production of lipopoly saccharides. 

Description of the Preferred Embodiments 

15 Organisms for use in this invention include all S-layer producing freshwater 

species or strains of Caulobacter , While similarity of the S-layer gene and S-layer 
secretion systems permits the use of different S-layer protein producing freshwater 
Caulobacter in this invention, the C-terminal secretion signals of the S-layer genes of 
C. crescentus strains CB2 and CB15 (and variants of those strains which contain 

2 0 homo logs of the rsaA gene encoding the 1026 amino acid paracrystalline S-layer 
protein described in: Gilchrist, A. et aL 1992 Can. J. Microbiol. 38:193-208) are 
specifically referred to in the detailed description and Examples set out below. 

Caulobacter strains that are incapable of forming an S-layer, including those 

2 5 which shed the S-layer protein upon secretion, may be used in this invention. 

Examples are the S-layer negative mutants CB2A and CB15AKSac described in 
Smit, J., and N. Agabian (1984) J. Bacteriol. 160:1137-1145; and, Edwards, P., and 
J, Smit (1991) J. Bacteriol. 173:5568-5572. Examples of shedding strains are 
CB15Ca5 and CB15Cal0 described in Edwards and Smit (1991), and the smooth 

3 0 lipopolysaccharide deficient mutants described in Walker, S.G. et al . (1994) - 

J. Bacteriol. 176:6312-6323. 
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A heterologous polypeptide as referred to herein may be any peptide, 
polypeptide, protein or a pan of a protein which is desired to be expressed in 
Caulobacter and which may be secreted by the bacterium. A polypeptide that is 
heterologous to a surface layer protein of a Caulobacter means a polypeptide not 
5 found in the surface layer protein native to the Caulobacter in which the heterologous 
polypeptide is expressed. 

Heterologous polypeptides include enzymes and other functional sequences of 
amino acids as well as ligands, antigens, antigenic epitopes and haptens. The size of 
10 the heterologous polypeptide will be selected depending upon whether an intact 
S-layer is to be produced in the Caulobacter or whether the protein to be recovered 
from the bacterial medium as described below. Heterologous polypeptides of about 
400 amino acids have been expressed. Preferably, the cysteine content of the 
heterologous polypeptide and the capacity for formation of ^isulphide bonds within 
IS the chimeric protein will be kept to a minimum to minimize disruption of the secretion 
of the chimeric protein. However, the presence of cysteine residues capable of 
forming a disulphide bond which are relatively close together, may not affect 
secretion. 

2 0 This invention may be practised by implementing known methods for insertion 

of a selected heterologous coding sequence into all or part of an S-layer protein gene 
so that both the S-Iayer protein and the heterologous sequence are operably linked 
thereby permitting the S-layer protein and the heterologous sequence to be transcribed 
together and "in-frame". Sequencing of an S-layer protein gene permits one to 

2 5 identify potential sites to install heterologous genetic material. The repetitive nature 

of the protein in the S-layer permits multiple copies of heterologous polypeptides to be 
expressed. 

The following general procedure lays out courses of action, with reference to 

3 0 particular plasmid vectors or constructions, that may be used to accomplish fusion of 

an S-Layer protein with a polypeptide of interest. The following description makes 
reference to the rsaA gene of C. crescentus described by Gilchrist, A. et al . (1992) 
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and in WO 97/34000, as an example. 

The general procedure includes detailed steps allowing for the following 
possibilities: 

5 

(1) use of a collection of potentially permissive sites in the S-layer gene to 
install the genetic information for a polypeptide of interest; 

(2) use of a carrier cassette for delivering a gene of interest to sites within 
1C the S-layer gene; 

(3) creation of a collection of random insertion sites based on a restriction 
enzyme of choice, if the available collection of potentially permissive sites is for some 
reason unsuitable; and, 

15 

(4) direct insertion of DNA coding for a polypeptide of interest into 
permissive sites. 

The general procedure involves the following steps and alternative courses of 
2 0 action. As a first step the practitioner may choose an appropriate region (or specific 
amino acid position) of the S-layer for insertion of a desired polypeptide. Second, the 
practitioner will create a unique restriction site (preferably hexameric) in the S-layer 
gene at a position within the gene encoding that region (or corresponding to a specific 
amino acid) using either standard linker mutagenesis (regional) or site directed 

2 5 mutagenesis (specific amino acid). The unique restriction site will act as a site for 

accepting DNA encoding the polypeptide of interest. For example, the plasmid-based 
promoter-less version of the rsaA gene (pTZ18U:rsaA P) described in Gilchrist, A. 
et_al. (1992) may be used because it contains an appropriate combination of 5' and 3' 
restriction sites useful for subsequent steps. Preferably, the restriction site should not 

3 0 occur in the S-layer gene, its carrier plasmid or the DNA sequence coding for the 

polypeptide of interest. 
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If it is unclear which region of the S-layer would be suitable for insertion of a 
polypeptide of interest, a random linker mutagenesis approach may be used to 
randomly insert a unique linker-encoded restriction site (preferably hexameric) at 
various positions in the gene. Sites for insertion of the linker are created using an 
5 endonuclease, either of a sequence specific nature (e.g. tetrameric recognition site 
restriction enzyme) or sequence non-specific nature (e.g. Deoxyribonuclease I 
[DNase I]). A particularly suitable method is the generalized selectable linker 
mutagenesis approach described in Bingle, W.H., and J. Smit. (1991) Biotechniques 
10: 150-152, by which endonuclease digestion is carried out under partial digestion 
10 conditions and a library of linker insertions at different positions in a gene is created. 
Partial digestion with different endonucleases create potential sites for insertion of a 
linker. 

If restriction endonucleases are used to create sites for subsequent insertion cf 
15 a linker encoding a hexameric restriction site, mutagenesis may also be done with a 
mixture of 3 different linkers incorporating appropriate spacer nucleotides in order to 
obtain an insertion with proper reading frame at a particular restriction site. With 
DNase I, only one linker is needed, but only 1 of 3 linker insertions may be useful for 
accepting DNA encoding the polypeptide of interest depending on the position of the 
2 0 DNase I cleavage. 

A linker tagged with a marker may be used to insert DNA of interest at a 
restriction site. For example, if BamHI sites are appropriate as sites for the 
introduction of DNA encoding a polypeptide of interest, BamHI linkers tagged with a 

2 5 kanamycin-resistance gene for selectable linker mutagenesis may be used. One such 

12-bp linker carried in plasmid pUC1021K for use in rsaA was described by Binsls 
and Smit (1991). Two additional 15-bp linkers (pUC7165K and pTZ6571K) for 
creating 2 other possible translation frames within the linker insert itself are described 
in Figures 3 and 4 of WO 97/34000. A mixture of three such linkers is preferably 

3 0 used for mutagenesis. 

Once a library composed of linker insertions encoding desired a hexameric 
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restriction site at different positions has been created, DNA encoding a polypeptide of 
interest may be inserted into the sites en masse . The library may be digested with the 
restriction enzyme specific for the newly-introduced linker encoded restriction site and 
iigated to a DNA fragment encoding the polypeptide of interest and carrying the 
5 appropriate complementary cohesive termini. DNA specifying the polypeptide of 
interest can be prepared by a number of standard methods, which may include 
oligonucleotide synthesis of 2 anti-complementary strands, polymerase chain reaction 
(PCR) procedures, or addition of linkers whose termini are compatible with the 
introduced sites in the target gene to a suitably modified segment of DNA. 

10 

In order to facilitate the rapid recovery of genes carrying DNA inserted at 
restriction sites, a carrier oligonucleotide may be used. An example of the use of 
such a carrier, shown in Figure 1 of WO 97/34000, was designed to accept DNA 
prepared by PCR or by anneating synthesized oligonucleotides and controls direction 

15 of insertion of the foreign segment into a rsaA gene through use of a promoterless 
drug resistance marker. The DNA of interest is first directionally cloned, if possible, 
using the Xho l, StuI, or Sail sites or non-directionally cloned using any one of the 
sites in the same orientation as a promoterless chloramphenicol resistance (CmR) 
gene. To do this the DNA of interest may be provided with the appropriate termini 

2 0 for cloning and spacer nucleotides for maintaining correct reading frame within the 
cassette and should not contain a Belli site. For insertion into the BamHI linker 
library, the DNA of interest is recovered as a BamH I fragment tagged with a CmR 
gene. When Iigated to the Bam HI digested rsaA linker library, only those colonies of 
the bacterium (eg. E. coli ) used for the gene modification steps that are recovered will 

2 5 be those carrying insertions of the desired DNA in the correct orientation, since the 

promoter on the plasmid is 5' to rsaA P and the CmR gene. This eliminates screening 
for DNA introduction and increases the recovery of useful clones. While still 
manipulating the library as one unit, the CmR gene is removed using Bglll. The 
carrier oligonucleotide also provides the opponunity to add DNA 5' or 3' to the DNA 

3 0 of interest at Sail, Xho l or StuI sites providing the DNA of interest does not contain 

any of these sites. This allows some control over spacing between rsaA sequences 
and the sequence of the DNA of interest. 
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Genes carrying the DNA of interest in the correct orientation may be excised 
from the plasmid and transferred to a suitable vector providing a promoter recognized 
by Caulobacter . Such vectors include pWB9 or pWBlO with EcoRI/SstI sites, as 
5 described in Bingle, W.H., and J. Smit. (1990), Plasmid 24: 143-148, The DNA of 
interest should not contain the same restriction sites present in the vector. This allows 
expression of the hybrids in S-layer negative mutants of Caulobacter . 

Caulobacter surviving transfer are examined for chimeric protein secretion, 
10 and optionally S-layer assembly or presentation of the new polypeptide activity, 
antigenicity, etc. on the cell, by methods specific to the needs of the investigator or 
the capabilities of the inserted sequence. Many of the sites created are "benign" as 
they have no effect on the functional regions of the protein involved with export, self 
assembly, etc. However, not every site that results in ^n absence of functional 
1 5 disruption of the S-layer is best for insertion of new activities. Some sites may not be 
well exposed on the surface of the organism and other sites may not tolerate insertion 
of much more DNA than the linker sequence. 

It is possible to express single or multiple insertions of heterologous 
20 polypeptides in a S-layer chimeric protein which will still assemble as an S-layer on 
the cell surface. Some sites may be sensitive to even small insertions resulting in the 
chimeric protein being released into the medium. Release may also be deliberately 
effected by use of a shedding strain of Caulobacter to express the chimeric protein or 
by physical removal of the S-layer from whole cells. Where S-layer assembly is not 

2 5 required, quite large polypeptides may be expressed as part of the S-layer protein. 

Expressing a chimeric protein containing a S-layer protein component having 
substantial deletions, may increase the size of the heterologous polypeptides that will 
be expressed and secreted by Caulobacter . 

3 0 The preceding methods describe insertion of linkers in-frame into a 

promoterless version of the S-layer gene. The sites that are introduced allow 
subsequent insertion of foreign DNA in-frame into the full length gene. This 
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invention also involves the construction of chimeric S-layer protein genes and the 
resulting production of chimeric S-Iayer proteins in which the S-layer gene component 
lacks large portions of the gene. This reduces the amount of Caulobacter protein 
present in the secreted chimeric protein. Generally, large deletions throughout the 
5 S-layer gene will result in a chimeric protein that is not capable of forming an S-layer. 
Attachment of the S-layer to the cell is abolished if the N-terrninal amino acids which 
contribute to S-layer formation are deleted. For example, deletion of the first 
29 N-terrninal amino acids of the RsaA protein will accomplish this. Also absence of 
the first 776 amino acids from the N-terminal region of RsaA will result in a chimeric 

1 0 protein secreted from the cell but having a S-layer component consisting of only the 
250 C-termina! amino acids of RsaA. Since only the extreme C-terminal region 
corresponding to approximately amino acids 945-1026 of RsaA is required for 
secretion of an S-layer chimeric protein from C. crescentus . use of only the 
C terminal secretion signal will prevent S-layer formation. Furthermore, use of only 

15 the C-terminal region promotes spontaneous aggregation of much of the secreted 
chimeric protein in the cell medium and formation of a macroscopic precipitate that 
may be collected with a course mesh or sheared to micron-sized particles. Yields of 
up to 250 mg. (dry weight) of protein per liter of cells may be possible. 

20 Sequence analysis of the 3* region of the S-layer genes from different strains 

of C. crescentus shows that the portion of the gene encoding the C-terminal region of 
the S-layer protein is highly conserved within the species. It has now been 
determined that while there is moderate variability in the sequence of surface layer 
proteins (including the secretion signal) from different species of freshwater 

2 5 Caulobacter, there is an unusually high sequence conservation among different 

Caulobacter species for the ABC-transporter protein and the membrane fusion protein. 
Sequence analysis of CB15 and CB2A (readily distinguishable strains of 
C. crescentus) shows identical DNA sequences coding for the last 118 amino acids of 
the RsaA protein (which includes the secretion signal) and sequencing of the next 

3 0 downstream translated gene (rsaD) to amino acid 97 of the gene product shows only a 

single base pair change, resulting in a conservative amino acid substitution in the 
ABC transporter protein. Sequence analysis of surface layer protein genes and the 
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transporter protein genes in species other than C, crescentus shows more variability in 
the secretion signals between species as compared to strains of C. crescentus . 
However, a much higher level of inter-species conservation exists with respect to the 
transporter proteins, even (as is the case with some species of Caulobacter ) when the 
5 transporters are not located immediately downstream from the surface layer protein 
gene. 

It having now been demonstrated that species of Caulobacter other than 
C. crescentus employ a C-terminal secretion signal for the surface layer protein and 
10 contain highly conserved transport proteins, the procedures described herein or are 
known in the art. may be readily employed for use surface layer protein secretion 
signals from Caulobacter other than C. crescentus and to identify and use species 
other than C crescentus as a host for expression of heterologous polypeptides. 

r 

15 The moderate inter-species conservation of surface layer protein genes 

(particularly for glycine-aspartic acid rich regions of the protein) may be exploited for 
locating a S-layer protein gene in a candidate Caulobacter , using known methods. 
Alternatively, the gene may be located by searching for a sequence hybridizable to a 
sequence derived from an amino acid sequence which is determined by sequencing the 

2 0 S-layer protein secreted by the candidate cell, using methods in the known art. 

The minimal amino acid tract from a Caulobacter that constitutes the essential 
surface layer protein secretion signal may be determined by the procedures described 
herein or by methods known in the art. One approach is to identify regions from 
2 5 S-layer genes of a Caulobacter which code for amino acid sequences that exhibit some 
identity to the last 82 C-terminal residues of the RsaA protein of C. crescentus . 
Homology to upstream sequences in the protein may also be assessed. Another 
approach is to delete N-terminal amino acids from the surface layer protein until 
secretion is lost. 

30 

Caulobacter other than C. crescentus may be screened for suitability as hosts 
for expression and secretion of heterologous polypeptides by determining whether a 
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candidate cell has a gene or gene product which exhibits sufficient identity to the rsaD 
or rsaE genes or RsaD or RsaE proteins from C. crescentus . This may be 
conveniently accomplished by determining whether a oligonucleotide probe based on 
the rsaD or rsaE gene sequences will selectively hybridize to DNA from the candidate 
5 cell. The probe is prepared by any means for construction of an oligonucleotide and 
will preferably have a sequence that is homologous to all or part of rsaD or rsaE. 
The probe will consist of at least 20, more preferably at least 30, more preferably at 
least 40, and even more preferably at least 50 nucleotides. The probe may be used 
for amplification by known procedures (eg. by PCR) of target DNA or may be 

10 labelled for direct determination of the presence of target DNA by known procedures. 
Labels include radio-labels, fluorescent labels, etc. Detection of target DNA may be 
accomplished through various standard techniques such as Southern blotting, in-siru 
hybridization, etc. Caulobacter other than C. crescentus are useful as host organisms 
for expression and secretion of heterologous polypeptides when the host contains a 

15 transport protein that is homologous to either the RsaD or RsaE proteins of 
C. crescentus. 



An amino acid or nucleic acid sequence is "homologous" to another such 
sequence if the two sequences are substantially identical and the function of the 
2 0 sequences is conserved (for example, both sequences function as or encode a secretion 
signal or a transport protein functional in Caulobacter ). Two amino acid or nucleic 
acid sequences are considered substantially identical if they share at least about 70% 
sequence identity, preferably at least about 80% sequence identity, more preferably at 
least about 90% sequence identity. Sequence identity may be determined using the 

2 5 BLAST algorithm, described in Altschul et al. (1990), J. Mol. Biol. 215:403-10 

(using the published default settings). In such circumstances, percentage of sequence 
identity may be expressed as a "homology" of the same percentage. 

An alternative indication that two nucleic acid sequences are homologous 

3 0 (substantially identical) is when two sequences selectively hybridize to each other 

under at least moderately stringent conditions. Hybridization to filter-bound 
sequences under moderately stringent conditions may, for example, be performed in 
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0.5 M NaHP0 4 , 7% sodium dodecyi sulfate (SDS), 1 mM EDTA at 65°C, and 
washing in 0.2 x SSC/0.1% SDS at 42°C (see Ausubel et al. (eds), 1989, Current 
Protocols in Molecular Biology , Vol. 1, Green Publishing Associates, Inc., and John 
Wiley & Sons, Inc., New York, at page 2,10.3). Higher sequence identity is 
5 demonstrated by hybridization to filter-bound sequences under stringent conditions 
which may (for example) be performed in 0.5 M NaHP0 4 , 7% SDS, 1 mM EDTA at 
65*C, and washing in 0.1 x SSC/0.1% SDS at 68°C (see Ausubel, ctal. (eds), 1989). 
Hybridization conditions may be modified in accordance with known methods 
depending on the sequence of interest (see Tijssen, 1993, Laboratory Techniques in 
10 Biochemistry and Molecular Biology - Hybridization with Nucleic Acid Probes , Part 
I, Chapter 2 "Overview of Principles of Hybridization and the Strategy of Nucleic 
Acid Probe Assays", Elsevier, New York). Generally, stringent conditions are 
selected to be about 5°C lower than the thermal melting point for the specific 
sequence at a defined ionic strength and pH. 

15 

In this invention, screening of Caulobacter for use as a host organism 
according to transport protein sequence identity may involve the use of 
oligonucleotide probes designed to selectively hybridize to target DNA, if the target 
contains DNA that is homologous (substantially identical) to all or part of rsaD or 
20 rsaE. Caulobacter of this invention comprise DNA encoding a transport protein that 
is homologous to either rsaD or rsaE. However, as disclosed herein, surface layer 
protein secretion signals useful in this invention may not exhibit such high identity to 
the secretion signal of the rsaA gene. The level of identity to RsaA or rsaA sequences 
might be less than 50%, which is lower than that required for "homology" as defined 
2 5 above. In such cases, the secretion signal may be solely defined according to is 
ability to effect transport of a protein of which the signal is the C-terminus, through 
the type I secretory system of a Caulobacter . The presence or absence of this function 
may be readily determined by monitoring extra-cellular occurrence of a protein of 
interest using known means or the procedures described herein. 

30 

Expression of heterologous polypeptides may be practised by use of modified 
S-layer genes borne on plasmids which may be readily constructed and introduced to 
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Caulobacter by electroportation. Typically, the plasmid is maintained in the 
Caulobacter by antibiotic selection. Highly modified S-layer genes with attached 
heterologous sequences may also be introduced into Caulobacter on a plasmid that is 
not replicated by Caulobacter since homologous recombination of the incoming 
5 modified S-layer gene with the chromosome-resident copy of the S-layer gene in the 
cell will often occur at a low but practicable frequency resulting in a gene rescue or 
transfer event. In some cases it may be desirable to obtain a stable cell line in which 
the chimeric S-layer gene is chromosomal. Various protocols for creating 
chromosomal insertions are set out in the Examples. 

10 

Use of Caulobacter S-layer protein as a vehicle for production of a 
heterologous polypeptide has several advantages. Firstly, the S-layer protein is 
synthesized in large quantities and has a generally repetitive sequence. This permits 
the development of systems for synthesis of a relatively large amount of heterologous 

15 material as a fusion product with an S-layer protein (chimeric protein). It may be 
desirable to retain the chimeric protein as pan of the bacterial cell envelope or, the 
fusion product may be separated from the organism, such as by the method described 
in: Walker, S.G., eral. (1992) J. Bacterid. 174:1783-1792. Alternatively, the 
Caulobacter strain that is used to express the fusion product may be derived from a 

2 o strain such as CB15Ca5 that sheds its S-layer. 

Caulobacter are particularly suited for use in bioreactor systems. An example 
would be the use of a modified Caulobacter to treat sewage, waste water etc. 
Caulobacters are ideal candidates for fixed-celi bioreactors, the construction of which 

2 5 is well-known (eg. rotating biological contactors). Other bacteria often produce 

copious polysaccharide slimes that quickly plug filtration systems. In some cases, 
other bacteria are not surface-adherent. By taking advantage of the natural bio-film 
forming characteristics of Caulobacter , bioreactors may be formed comprising a 
substrate and a single layer of cells adhered thereon, with the cells distributed at high 

3 0 density. A variety of substrates may be used such as a column of chemically 

derivatized glass beads or a porous ceramic material such as ceramic foam. 
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Another application is in the production of batch cultures of modified 
Caulobacter wherein the S-layer protein is a fusion product with an enzyme. For 
example, such Caulobacter could be grown in wood pulp suspensions at an 
appropriate juncture of the pulping process in order to provide for enzymatic 
5 decomposition of the wood-pulp structure. 

Examples of enzymes that may be expressed as chimeric S -layer proteins 
include alkaline phosphatase (eg. by expression of the pho A gene of E. coli ; 
see: Hoffman, C.S., and Wright, A. (1985) Proc. Natl. Acad. Sci. U.S.A. 
10 82:5107-5111; Bingle, W.H., eral. (1993) Can. J. MicrobioL39: 70-80; and, Bingle. 
W.H. and Srnit, J. (1994) Can. J. Microbiol. 40:777-782.) and, cellulase (eg. by 
expression of the CenA gene of Cellulomonas fimi ; see: Bingle, W.H. et al . (1993); 
and, Bingle, W.H. and Smit, J. (1994). 

15 Another application is the production of organisms that secrete and optionally 

present vaccine-candidate epitopes. Modified Caulobacter may be readily cultured in 
outdoor freshwater environments and would be particularly useful as fish vaccines. 
The two-dimensional crystalline array of the S-protein layer of Caulobacter , which 
has a geometrically regular, repetitive structure, provides an ideal means for dense 

20 packing and presentation of an epitope as part of an intact S-layer on the bacterial cell 
surface. 

Polypeptides secreted by Caulobacter may be harvested in large quantities, 
relatively free of contaminants and protein of host cell origin. Expression of a 

2 5 heterologous polypeptide fused with sufficient C-terminal amino acids of the S-layer 

protein to promote secretion of the protein results in the accumulation of large 
quantities of secreted product in the cell medium. The chimeric protein does not have 
to be released from the cell surface, but adjustment of the size of the S-layer protein 
portion can dictate whether the secreted chimeric protein is soluble or will precipitate 

3 0 in the cell medium. This is useful in cases where the Caulobacter is used to express a 

foreign antigenic component and it is desired to minimize the amount of host cell 
pro tern associated with the antigen. 
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Example 1: Production of Permissive Insertion Sites in C. crescentus 

Using the restriction enzyme Taq i, a partial digestion of the rsaA gene in 
5 pTZ18U:rsaA P produced a group of linearized segments with random Taq i sites 
cleaved. The linearized segments were modified by use of the tagged linker 
mutagenesis procedure of Bingle and Smit (1991), using the 12-bp BamHI linker 
carried in plasmid pUC102K discussed in the general procedure above. Those 
products that produced a full-length protein in E. coli were ultimately transferred to 
10 pWBI (a minor variation of pWB9 that is replicated by Caulobacter ), as described in 
the general procedure. The resulting construction was introduced into a C. crescentus 
strain. Distinguishable events were retrieved and analyzed for the ability to produce a 
full-length protein in C. crescentus and to produce the crystalline S-layer on their 
surface and the approximate location of the insertion. Cells were screened for the 
15 presence of a S-layer protein of approximately lOOkDa that is extracted from the 
surface of whole cells by 100 mM HEPES at ph2. The results of this screening 
resulted in five successful events. 

The five positive events represented cases where a 4-ainino acid insertion was 
2 0 tolerated with no effect on the S-layer function. The S-layers of the modified 
Caulobacter were indistinguishable from a wild-type S-layer. By producing 
3 versions of the gene of interest, representing each possible reading frame (using 
standard linker addition technology), one may test each of these sites for suitability in 
expressing the desired activity. Also, by using restriction enzymes other than Taq i 

2 5 (such as Aci L HinPI or Msp l) a larger library of BamH I insertions may be created. 

Example 2: investigation of Other Permissive Sites in rsaA Gene 

A library of 240 Bam HI linker insertions was created using the procedures of 

3 0 Example 1. Of the 240 insertions, 45 target sites in the rsaA gene were made with 

Taq i. 34 of the latter insertions were discarded because the clones contained deletions 
of rsaA DNA as well as the linker insertions. The remaining 11 resulted in 
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5 non-permissive and the 5 permissive sites found in Example 1 . The remaining 
195 insertions in the library were made using the enzymes HinPI, Acil, and Msp l to 
create target sites as outlined in Example 1. Of the latter 195 insertions, 
49 permissive sites were located for a total of 55. Of those sites scored as 
5 non-permissive, some may have had deletions of rsaA DNA at the linker insertion 
site. One BamHI linker insertion at a TaqI site thought to be permissive was later 
found by nucleotide sequencing to be located outside the rsaA structural gene reducing 
the total number of permissive sites to 54 from 55. The results show that sites that 
will accept 2-4 amino acids while still allowing the protein to be made and assembled 
1 0 into an S-layer are scattered up and down the protein. There is a high proportion of 
sites at which such insertions do not prevent expression and assembly of the S-layer. 
Approximately 25-50% of in-frame linker insertions will be tolerated by the S-layer 
protein and the Caulobacter and that diverse regions of the protein will tolerate 
insertions. 

15 

Example 3: Studies with Cadmium Binding Polypeptides 

Following the foregoing procedures, single and multiple copies of DNA 
encoding a synthetic cadmium binding peptide were synthesized, inserted at the amino 
2 0 acid 277 site of rsaA using the above described Carrier cassette, and expressed in 
C. crescentus . The peptide has a single cysteine residue. Mild acid extracts of whole 
cells expressing the modified gene were subjected to SDS-PAGE for identification of 
S-layer proteins. The S-layer protein was expressed and secreted when there was 
from 1 to 3 copies of the cadmium binding peptide present at RsaA amino acid 

2 5 position 277. Insertion of 4 or more copies resulted in a dramatic reduction of S-layer 

protein released from the whole cells by mild acid treatment to barely detectable 
levels. Detection by autoradiography of RsaA protein in vivo labelled with 35 S- 
cysteine and in vitro with 125 I- iodoacetamide confirmed that the cadmium binding 
peptide was pan of the chimeric protein. This demonstrates that C. crescentus is 

3 o capable of secretion of a chimeric rsaA protein having a limited cysteine content and a 

limited capacity for disulphide bond formation but that increased capacity of 
disulphide bond formation will limit production. 



WO 00/49163 



-21 - 



PCT/CAOO/00173 



Example 4: Expression and Presentation of Antigenic Epitopes on Caulobacter Cell 
Surface 

5 Using the library of the 49 permissive sites other than those made with TagI 

described in Example 2, the coding sequence for a 12-amino acid pilus peptide 
epitope lacking cysteine residues from Pseudomonas aeruginosa PAK pilus (described 
in Figure 8 of WO 97/34000) was inserted at the sites using the procedures described 
above, employing the carrier cassette described above. Positioning of the inserted 
10 DNA between the first Bam HI site and the Bgl n site permitted use of the latter site 
for making repeated insertions of DNA. DNA coding for the PAK pilus peptide was 
prepared by oligonucleotide synthesis of two anti-complementary strands. 

The transformed bacteria were screened for both production and presentation 
15 of the epitopes by the transformed Caulobacter using standard Western immunoblot 
analysis (see: Bumette, W. N. (1981) Analytical Biochemistry 112:195-203) and by 
colony immunoblot tests in which the cells were not disrupted (see: Engleberg, N.C., 
etjl. (1984) Infection and Immunity 44:222-227). Anti-pilus monoclonal antibody 
(PK99H) obtained from Dr. Irvin, Dept. of Microbiology, University of Albena, 
2 0 Canada was used in the immunoblot analyses to detect the presence of the pihis 
epitope insert. The antibody was prepared using purified Pseudomonas aeruginosa 
PAK pilus as the antigen and a monoclonal antibody was isolated by standard 
techniques using BALB/C mice as a source of ascites fluid. Reaction with the 
antibody in a whole cell colony immunoblot assay showed that the epitope is not only 

2 5 expressed in the transformed Caulobacter but is exposed on the S-layer surface 

overlying the cell in such a way that the epitope is available to the antibody. When 
two cysteine residues of the piiin epitope were incorporated in the chimeric protein, 
the protein was still expressed and secreted at normal levels. 

3 0 Of the organisms screened, insertions of the pilus epitope at the following sites 

in the rsaA gene as determined by nucleotide sequencing resulted in a positive 
reaction with the antibody in the whole cell Colony immunoblot analysis: 69, 277, 
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353, 450, 485, 467, 551, 574, 622, 690, 723, and 944. The results show that the 
permissive sites that will accept polypeptides of the size of the epitope are numerous 
and scattered across the gene. 

5 Further studies with the pilus peptide resulted in successful expression and 

secretion of chimeric proteins having single copies of the peptide at various other 
locations. Also, four and seven copies of the peptide were expressed and secreted as 
a RsaA chimeric protein when inserted at amino acids 277 and 551 respectively of the 
RsaA protein. However, insertions of the peptide at amino acids 69, 277, 450, 551 
1 0 and 622 resulted in a chimeric protein that did not attach to the cell surface and was 
released into the culture medium. 

Example 5: Insertion of Large Polypeptides 

f 

15 Bacterial surface proteins from organisms other than Caulobacter are generally 

not known to accept polypeptides larger than about 60 amino acids within the 
structure of the surface protein. The procedures of the preceding Example were 
carried out in order to insert the coding sequence of a 109 amino acid epitope from 
IHNV virus coat glycoprotein at the same insertion sites. The EHNV epitope was 
2 0 prepared by PGR and had a sequence as shown in Figure 9 of WO 97/34000, which is 
equivalent to amino acid residues 336^44 of the IHNV sequence described 
in: Koener, J.F. et al . (1987) Journal of Virology 61:1342-1349, Anti-IHNV 
polyclonal antibody against whole IHNV obtained from Dr. Joann Leong, Dept. of 

Microbiology, Oregon State University, U.S.A. (see: Xu, L. eyd. (1991) Journal of 
2 5 Virology 65:1611-1615), was used in immunoblot assays as described above to screen 

for Caulobacter that express and present the EHNV sequence on the cell surface. 

Reaction in the whole cell colony immunoblot assay was positive in respect of 

insertions at sites 450 and 551, and negative at a site which was at approximately 

amino acid 585. 

30 

The IHNV insert contains a single cysteine residue and is an extremely large 
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insert for successful expression as a fusion product with a bacterial surface protein. 
In further studies, the same 109 amino acid portion of the IHNV glycoprotein was 
inserted at amino acid 450 of the RsaA protein. The protein secreted by C. crescenrus 
was recovered from the cell culture medium. SDS-PAGE analysis showed that some 
5 of the protein was smaller than the predicted rsaA chimeric protein but still bound the 
anti-EHNV antibody. Analysis of these proteolytic products showed that cleavage of 
the chimeric protein occurred at an Arg residue encoded by the gene transfer cassette. 
Thus in some cases, adjustment of the nucleotide sequence at the interface of the 
polypeptide and rsaA coding sequences may be necessary to prevent expression of an 
10 arginine residue. 



Example 6: 



Methods are described above for the insertion of 12-bp BamHI linker sites into 
15 a promoterless version of the rsaA gene. Because linker insertions involve the 
insertion of 12 bp (a multiple of three), an in-frame linker insertion resulted in every 
case. These linker sites are introduced to allow subsequent insertion of DNA 
encoding foreign peptide/proteins. Expression of such chimeric genes leads to the 
production of an entire full-length RsaA protein carrying the inserted heterologous 
2 0 amino acid sequence of interest. A number of BamHI site positions were identified 
above precisely by nucleotide sequencing. Four of the sites in the rsaA gene 
correspond to amino acid positions 188, 782, 905, 944 in the RsaA protein. For this 
example, an additional linker insertion was created at amino acid position 95 of the 
native gene (i.e. this gene carried its own promoter) using the same methodology. All 

2 5 five in-frame BamHI linker insertion sites were inserted in the rsaA so that the 

nucleotides of the linker DNA were read in the reading frame GGA/TCC. 

Because all BamHI linker nucleotides were read in the same reading frame, 
the 5' region of one rsaA gene carrying a BamHI linker insertion at one position 

3 0 could be combined with the 3' region of an rsaA gene carrying another of the Bam HI 

linker insertions to create in-frame deletions with a BamHI site at the joint between 
adjacent regions of rsaA. Using such a method, in-frame deletions of rsaA 
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( AA95-782) and rsaA( AA188-782) were created. 

DNA fragments encoding various C-terminal portions of the 1026 amino acid 
RsaA protein were isolated using the newly inserted BamHI linker sites as the 
5 5' terminus of the fragment and a HindH I site as the 3' terminus of the fragment. 
These BamHI fragments were transferred to the BarnHI/Hiixffll sites of pUC8 
(J. Vieira, and J. Messing. (1982) Gene 19:259-268) creating rsaA C-terminal 
segment carrier plasmids (see Figure 12 of WO 97/34000). The insertion into pUC8 
also resulted in the creation of an in-frame fusion between the first 10 N-terminal 
10 amino acids of LacZa and the various C-terminal fragments (AA782-1026, 
AA905-1026 or AA944-1026) of RsaA. These LacZarrsaA fusion proteins can be 
produced in Caulobacter using the lacZa transcription/translation initiation signals 
when introduced on appropriate plasmid vectors or direct insertion into the 
chromosome (see: W.H. Bingle, etal. (1993) Can. J. Micro f biol. 39:70-80). 

15 

Both types of construction, the deletion versions and the C-terminal only- 
segments, resulted in the production of proteins secreted by the Caulobacter as highly 
modified S-layer proteins. The gene segments can also facilitate the secretion of 
heterologous polypeptides by insertion or fusion of appropriate DNA sequences at the 
2 0 unique BamHI site that exists in each of the constructions, as described below. 

A- Creating Fu sions of Desired Sequences with C-terminal Portions of a Caulobacter 
S-layer Gene -Method 1 

2 5 The process may be as follows: 

(1) Inserting the desired sequence into the Carrier cassette. Heterologous sequences 
may be introduced into a carrier by: 

3 0 (a) insertion of a single copy of the desired gene segment. 

Depending upon the length of a gene segment, two methods of construction 
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may be used. For segments of up to about 30 amino acids, two oligonucleotides of 
appropriate sequence may be chemically synthesized, annealed by mixing, heating and 
slow cooling and ligated into the carrier cassette. The oligonucleotides will also 
contain additional base pairs that recreate "sticky ends" of appropriate restriction 
5 endonuclease sites at each end of the duplex DNA that results from the annealing 
process. 

For longer segments, PCR may be used to amplify a region of a target DNA 
sequence. Oligonucleotides are synthesized that have sequence complementary to the 
10 boundaries of the desired sequence and which contain additional base pairs that 
recreate a "sticky end" of an appropriate restriction endonuclease site. In the present 
example oligonucleotides are made to produce products with the appropriate 
restriction endonuclease site for directional cloning into the carrier cassette. PCR 
amplification of the desired sequence is then done by standard methods. 

15 

For each method, sticky ends must be appropriate for restriction sites at the 
5' terminus and the 3' terminus. This places the desired gene segment in the correct 
orientation within the carrier cassette, Reading frame continuity is maintained by 
appropriate design of the oligonucleotides used for the PCR step. 

20 

(b) Preparation of multiple copies of the desired gene segment. 

The carrier cassette also allows for production of multiple insert copies. For 
example, a restriction site in the cassette may be restored after removal of a 

2 5 promoterless antibiotic resistance gene and the site is then used to insert an additional 

copy as described in WO 97/34000. This "piggy-back" insertion still maintains the 
correct reading frame throughout the construction. Any number of additional cycles 
of "piggy -backing" can be done because the ligation results in a sequence which is no 
longer a substrate for the restriction enzymes. The result is the production of 

3 0 cassettes of multiple copies of the desired sequence which can be transferred to 

appropriately modified S-layer protein genes with the same ease as a single copy. An 
additional feature of this method is that different heterologous sequences can be paired 
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together in this multiple copy cassette with the same ease as multiple copies of the 
same heterologous sequence. 

Example 6a: Insertion of an 109 amino acid segment of the IHNV surface 
5 glycoprotein to Carrier cassette, 

A PCR product was made that contained the DNA coding for amino acids 336 
to 444 of the major surface glycoprotein of the Infectious Hematopoietic Necrosis 
Virus (IHNV), as described in WO 97/34000. 

LO 

Example 6b: Insertion of an 184 amino acid segment of the IHNV surface 
glycoprotein to Carrier cassette. 

A PCR product was made that contained the DNA coding for amino acids 270 
15 to 453 of the IHNV glycoprotein segment. 

Example 6c: Insertion of single and multiple copies and an epitope of the 
Pseudomonas aeruginosa PAK pilus gene to Carrier cassette. 

2 0 Oligonucleotides were constructed to code for the pilus epitope described in 

Example 4. Using the methods outlined in pan A(l)(b) of this Example, 3 tandem 
copies were prepared. 

(2) Transfer of Carrier Cassette to C-terminal Segment Carrier Plasmids. The 

2 5 constructs described in Examples 6a and 6b were transferred to a rsaA C-terminal 

Segment carrier plasmid, as described above, resulting in an in-frame fusion of: (a) a 
10 amino acid section of the p-galactosidase protein, (b) the desired sequence flanked 
by 2-3 amino acids derived from the carrier cassette sequence, and (c) the appropriate 
rsaA C-terminal segment. In some cases, the first codon of the rsaA C-terminal 

3 0 segment is converted to a different codon as a result of the fusion. For example. 

while the rsaA C-terminal segment may have coded for amino acids 944-1026 of 
RsaA, the resulting chimeric protein may only have amino acids 945-1026 native to 
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RsaA. 

Example 6d: Fusion of carrier/ 109 AA and 184 IHNV segments to C-terminal rsaA 
segment AA782-1026. 

5 

This was done using the carrier cassettes described in Examples 6a and 6b and 
the AA782-1026 rsaA C-terminal segment carrier plasmid described above. 

Example 6e: Fusion of Carrier/ 109 AA and 184 AA IHNV segments to C-terminal 
10 rsaA segment AA9O5-1026. 

This was done using the carrier cassettes described in Examples 6a and 6b and 
the AA905-1026 rsaA C-terminal segment carrier plasmid described above. 

15 Example 6f: Fusion of Carrier/ 109 AA and 184 AA EHNV segments to C-terminal 
rsaA segment AA944-1026. 

This was done using the carrier cassettes described in Examples 6a and 6b and 
the AA944-1026 rsaA C-terminal segment carrier plasmid described above. 

20 

Example 6g: Fusion of Carrier/3x Pilus Epitope Segment to C-terminal rsaA 
Segment AA782-1026. 

This was done using the carrier cassettes described in Example 6c and the 
2 5 AA782-1026 rsaA C-terminal segment carrier plasmid described above. 

(3) Expression of the Desired Fusion in an Appropriate Caulobacter Host Strain . 

(a) Plasmid-based expression. 

30 

To create plasmid vectors that can be introduced and maintained in 
Caulobacter , an entire C-terminal segment carrier plasmid may be fused to a broad 
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host range vector such as pKT215 or pKT210 (see: M. Bagdasarian, et ai. (1981) 
Gene 16:237-247) using the unique HindlH restriction site present in each plasmid. 
The resulting plasmid is introduced into Caulobacter by conjugation or electroporation 
methods and is maintained by appropriate antibiotic selection. 

s 

The fusions described in Examples 6d-6g were expressed in C. crescentus . In 
each case expression and secretion of the chimeric S -layer protein was detected by 
Western immunoblot analysis of electrophoretic gels of the cell culture supermutant 
employing the monoclonal antibody for each of the polypeptide epitopes. The 
10 transporter signal is localized to amino acids 945-1026 of the S-layer protein since all 
the chimeric proteins in the Examples were secreted. Precipitation of the chimeric 
protein occurred with the use of rsaA segment AA782-1026 but not AA944-1026. 
Recovery of precipitate using AA905-1026 was reduced as compared to AA782-1026. 

15 (b) Selection of appropriate C. crescentus host strains. 

It is often desirable to use a S-layer negative host strain such as CB2A or 
CBlSaKSac, If it is important to ensure that the fusion protein is not attached to the 
cell surface, the use C. crescentus strains CB15Ca5KSac or CBlSCalOKSac may be 
2 0 appropriate. The latter strains have additional mutations that result in the loss of the 
production of a specific species of surface lipopolysaccharide that has been 
demonstrated to be involved with the surface attachment of native S-layer protein as a 
2-dimensionaI crystalline array (see: Walker S.G. etal. (1994) J. Bacterid. 
176:6312-6323). With highly modified versions of an S-layer gene, this provision is 

2 5 not necessary since virtually all regions of the gene that may have a role in the 

attachment process will be absent. 

An example of a growth media well suited to both propagation of Caulobacter 
for general purposes (including cloning steps) and also to produce the secreted and 

3 0 aggregated chimeric proteins is PYE medium, a peptone and yeast extract based 

medium described in Walker et al. t (1994). 
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B- Creating Fusions of Desired Sequences with C-terminal Portions -Method 2 

Methods other than the use of a carrier cassette plasmid are possible for 
creation of heterologous insertions into deletion versions of a S-layer gene or fusions 
5 with C-terminal portions of a S-layer protein. PCR may be used or other known 
methods may be used. The general procedure is as follows: 

(1) Use of PCR to prepare appropriate segments : 

(a) Preparation of amplified segment with appropriate ends may be carried 
out In a manner similar to that described part A(l)(a) of this example. 
Oligonucleotides are designed and synthesized such that they will anneal to 
appropriate regions of the desired heterologous DNA and also contain "sticky ends" 
of appropriate sequence and frame so that the resulting PCR product can be directly 
inserted into appropriate modified S-layer genes. 

(b) Transfer to appropriate C-terminal segments may be carried out by 
inserting the PCR products into selected C-terminal segments such as AA782-1026, 
AA905-1026, or AA944-1026, as described in Examples 6d-6g. In addition to the 
Bam HI site described, the EcoRl restriction site could also be used as the 5* terminus 
of the incoming PCR segment, since this site is also available in the pUC8 vector and 
not in the S-layer gene, so long as the correct reading frame was maintained when 
designing the oligonucleotides used to prepare the PCR product. 

2 5 (2) Expression of the desired fusion in an appropriate Caulobacter host strain may 
be carried out using the procedures outlined in part A(3) of this example. 

C- Creating Insertions of Desired Sequences into Versions of a S-layer Gene 
Having Large Internal In-frame Deletions, 

30 

The general process may be as follows* with reference to rsaA: 
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(1) Creating Appropriate I rv frame Deletions 

rsaA ( AA95-782) and rsaA( AA 188-782) may be prepared as described 
above. Because most of the BamH l linker insertion sites are in the same reading 
5 frame with respect to each other, it is possible to combine other pairs of 5' and 3' 
segments using the same general method, with the same result of maintenance of 
correct reading frame throughout. These deletion versions may then be tested 
individually to ensure that S-layer protein is still secreted by the Caulobacter . 

1 0 (2) Insertion of a Gene Segment carrier Cassette Containing the Desired 
Sequences : insertion and transfer of carrier cassettes may be done using the 
procedures described in parts A(l) and A(2) of this example. 

Example 6h: Insertion of the 109 AA IHNV segment into rsaA ( AA95-782) and 
15 insertion of the 109 AA IHNV segment into rsaA( AA 188-782) may be carried out as 
in Examples 7d~7g. Expression of the desired genetic construction in appropriate 
C. crescentus strains may be done using the procedures outlined in part A(3) of this 
example. 

2 0 (3) Alternate PCR Procedures ; may be used to prepare a heterologous segment 
for direct insertion into the BamHI site with the deletion versions of the rsaA gene. 
The procedure is essentially the same as described in part B(l) of this example. 

Example 7: Transfer to a Native S-layer Gene Chromosomal Site as a Single 

2 5 Crossover Event 

Fusion of a carrier cassette containing heterologous DNA segments to a 
C-terminal S-layer protein segment plasmid results in a plasmid that is not maintained 
in Caulobacter . Selection for the antibiotic marker on the plasmid results in detection 

3 0 of the rescue events. Most commonly these are single crossover homologous 

recombination events. The result is a direct insertion of the entire plasmid into the 
chromosome. Thus the resident copy of the S-layer gene remains unchanged as well 
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as the bcoming modified S-!ayer gene. In such cases it may be desirable to use 
Caulobacter strains in which the resident S-!ayer gene has been inactivated by 
adapting known procedures. One example is C crescentus strain CBlSAKSac which 
has an antibiotic resistance gene cassette introduced at a position in the S-layer gene 
5 about 25% of the way from the 5' terminus. 

Example 8: Transfer to a Native S-Iayer Gene Chromosomal Site as a Double 
Crossover Event 

10 In certain cases it may be desirable to completely exchange a resident S-layer 

gene with an incoming modified version. One method is by the incorporation of a 
sacB gene cassette (Hynes, M.F., et al. (1989) Gene 78: 111-119) into pUC8 based 
piasmids carrying the desired chimeric gene construction. This cassette contains a 
levansucrase gene from Bacillus subtilis that, in the presence of sucrose, is thought to 
1 5 produce a sugar polymer that is toxic to most bacteria. One first selects for a single 
crossover event as described in Example 7. Subsequent growth on sucrose-containing 
medium results in the death of all cells except those that lose the offending sac B gene 
by homologous recombination within adjacent gene copies. Two events are possible; 
restoration of the resident copy of the S-layer gene or replacement of the resident 
2 0 copy with the incoming modified gene. A screen with insertion DNA as probe or 
antibody specific to the heterologous gene product identifies successful gene 
replacement events. The method requires that S-layer gene sequence or native 
sequences immediately adjacent to an S-layer gene be present on both sides of the 
heterologous sequence and is best suited for deletion versions of a S-layer gene. 

25 

Other methods are available for the delivery of genes to the chromosome of 
Caulobacter . Methods involving the use of the transposons Tn5 and Tn7 as a means 
of delivery of genes to random chromosome locations are available (see: Barry, 
G.F. (1988) Gene 71:75-84.). The use of the xylose utilization operon as a target for 
3 0 chromosome insertion have also been described. This method involves the 
incorporation of a portion the operon into a pUC8 based plasmid construction. This 
allows homologous recombination within the xylose operon as a means of plasmid 
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rescue. Loss of the ability to use xylose as a nutrition source .confirms .the. rescue 
event. 

Example 9: Transformation and Expression of Heterologous Protein in Caulobacter 
5 other than C, crescentus 

Using the procedures described above, a DNA construct made according to 
Examples 4 and 6 was introduced into the freshwater S-layer producing Caulobacter 
identified as FWC42 in MacRae, J.D, and J, Sink (1991) and in Walker, S.G. et al . 
10 (1992). FWC42 is clearly distinct as a species separate from C. crescentus . The 
construct contained 3 copies of the pilus epitope and a nucleotide sequence encoding 
amino acids 690-1026 of RsaA as the secretion signal. The heterologous polypeptide 
was expressed by the transformed FWC42 cells and was secreted at sufficient levels 
such that the secreted protein was found in the cell medium as an aggregate. 

15 

Example 10: Demonstration of Type I Secretion Mechanism and 
Sequence Similarity in Different Caulobacter Species 

The following non-C. crescentus species of freshwater Caulobacter as 
2 0 described in MacRae, J.D. and J. Smit (1991) and in Walker, S.G. et al (1992) were 
employed in this Example: FWC1, FWC8, FWC9, FWC17 and FWC19, FWC28, 
FWC32, FWC39 and FWC42. 

Employing the materials and methods described in Awram, P. and J. Smit 

2 5 (1998) J. of Bacteriology 180:3062-3069, species FWC8, 9, 17, 19, 28, 32, 39 and 

42 were transfected with plasmids containing the P. aeruginosa alkaline protease gene 
(aprA) which is a known type I secretory protein. The protease was shown to be 
secreted at levels comparable to the levels of such protease reported by Awran and 
Smit for C crescentus transformed in the same way. Thus, the transport mechanism 

3 0 in the non-C. crescentus species are Type I mechanisms capable of recognizing 

diverse Type i (C-terminal) secretion signals. 
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The following recombinant DNA and DNA sequencing methods are described 
in Awram, P. and J. Smit (1998) and may be used with appropriate adaptation in this 
invention. These procedures may be used in screening suitable Caulobacter for use as 
host organisms and for identification of Caulobacter of this invention. E. coli DH5 a 
5 (Life Technologies) was used for all E. coli cloning manipulations. E, coli was 
grown at 37<>C in Luria broth (1% tryptone, 0.5% NaCl, 0.5% yeast extract) with 
1.2% agar for plates. Caulobacter was grown at 30<>C in PYE medium 
(0.2% peptone, 0.1% yeast extract, 0.01% CaCl 2t 0.02% MgS0 4 ) with 1.2% agar 
for plates. Ampicillin was used at 100 ^g/ml, streptomycin was used at 50 ^g/ml, 
10 kanamycin was used at 50 ^g/ml, and tetracycline was used at 0,5 ^g/ml for 
Caulobacter and at 10 ^g/ml for E. coli when appropriate. 

Standard methods of DNA manipulation and isolation were used. 
Electroporation of Caulobacter was performed as described above. Southern blot 
15 hybridizations were done in accordance with the membrane manufacturer's manual 
(Amersham Hybond-N). Radiolabeled probes were made by nick translation using 
standard procedures. 

PCR product containing rsaD and rsaE was generated using primers 
2C 5 ' -CGG AATCGCGCT ACGCGCTGG-3 ' (SEQ ID NO:2) and 
5 1 -GGG AGCTCG AAGGGTCCTG A-3 1 (SEQ ID NO:3). Product was generated 
using Taq polymerase (Bethesda Research Laboratories) and following the 
manufacture's suggested protocols. Following a 5-min denaturation at 95 0 C, two 
cycles of 1 min at 42°C, 2 min at 65°C 1 and 30s at 95 °C were followed by 25 cycles 
2 5 of 1 min at 55 °C, 2 min at 65 °C, and 30 s at 95 °C. The vector pBSKS+ Stratagene 
was cut at the EcoKW site and T tailed. The PCR product was ligated into this vector 
so that rsaD and rsa E would be in the same orientation as the lacZ promoter of 
pBSKS + . This construct was called pRAT5 . 

30 

Plasmid pBBR5 was constructed from plasmids pBBRlMCS (Kovach, M.E. 
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et al . (1994) BioTechniques 16:800-802) and pHP45Q-Tc (Feilay, R. et_al. (1987) 
Gene 52:147-154). The q-Tc fragment from pHP45Q-Tc was removed by using 
HindUl, and the ends were blunted by using T4 polymerase. A 0.3-kbp portion of the 
Cmr-encoding gene was removed from pBBRlMCS b7 cutting with Oral and replaced 
5 with the blunted q-Tc Fragment, producing a Tcr broad-host-range vector that 
replicates in Caulobacter . 

Plasmid pRAT4^H was made by removing the Clal-HindUl fragment from 
pTZ18UB:rsa A P (Bingle, W.R etal. (1997) J, Bacterid, 179:601-611) and replacing 
10 it with the Clal-HindUl fragment from pRATl containing the C-terminus of rsaA and 
the complete rsaD and rsaE genes. 

A NA1000 cosmid library (Alley, M.R. etj. (1991) Genetics 129:333-341) 
was probed with radiolabeled rsaA. 11 cosmid clones hybridizing to the probe were 

15 isolated. Southern blot analysis was used to determine which cosmids contained DNA 
3' of rsaA* An 11.7kb S&l-EcoRl fragment containing rsaA plus 7.3 kb of 3 1 DNA 
was isolated from one of the cosmids and cloned into the &rl-EcoRl site of 
pBSKS + ; the resulting plasmid was named pRATl. The 3 1 end of the cloned 
fragment consisted of 15 bp of pLAFRS DNA containing Sau3 Al, Smal; and EcoRI 

20 sites. 

BamKL fragments from pRATl were subcloned into the BarnVH site of vector 
pTZ18R for sequencing. The 3' -end fragment was subcloned into pTZ18R by using 
BamYSl and EcoRI. The 5* -end fragment was subcloned into pTZ18R by using 

2 5 HindUl. Sequencing was performed on a DNA sequencer (Applied Biosystems™ 

model 373). After use of universal primers, additional sequence was obtained by 
"walking along" the DNA using 15-bp primers based on the acquired sequence. 
Nucleotide and amino acid sequence data were analyzed by using Geneworks™ and 
MacVectonw software (Oxford Molecular Group) or the National Center for 

3 0 Biotechnology Information BLAST e-mail server using the BLAST algorithm. 

Protein alignments were generated by using the ClustalW™ algorithm as implemented 
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by the MacVector™ software and using the default settings. 

Sequences for rsaD and rsaE have been assigned GenBank Accession 
No. AF062345. The proteins have the following sequences: 

5 

RsaD (SEQ ID NO:4): 

MFKRSGAKPTIFI>2AVLVARPA 

FLVYGIJ-EAIJiTCr^VRGGI^FEXSVARD P IFKSVIJDSTLSRKGIGGQAFHDMDQVREF^ 

KWGGI^AKWRARi^EQVAWQAA^^ SAGAMXAGS ILV 

GTRAIAPIEGAVG^WKbTf IGARG^ 

S FRID AGAAVALVGPS AAGKS SIJ^GIVGVWPCAAGVTRIJ^GYD XKQWD PEKLGKHVGYLPQD IELFSGT 
VAQNXXRFTE FE SQEVIEAATIAGVHEmOSI-PhiGYDTAJCCT 
1 5 D£P1^IX>QVGEVAI>EAMKRIJ^^ 

RsaE (SEQ ID NO:5): 

MKPPKIQRPTONFQAVARIGY GI IALTFVGIJjGWAAFAP ID 5AVXAKGWSAEVSQDVQHLE GGMLAKIL 
2 0 VREG£KVKAGQVI^IJ}PTQANAAA^ 

IADEQAQFTERRQT IQGCfVDIMMAQKLQYQSE XEGIDRQTQGIJKDQI/SFIEDELIDIJ^KLYDKGLVPRPR 
IJLAIiEAKAGSLSGSIGRLTADRSKAVQGASDTQUCW 

AQKRI KXVS FVHGT AQNXJIFFTEGAVVRAAE PLVD IAPEDEAFVT Q AHFQPTDVD^^/HM(>^VTEVRI*P AF 
K SAGNPD PERKD PVAVADR I SDPQKQAJU^FLXSIVKVDVKQI^PHI^^ 

2 5 FSPLPBTLRTTMREE 

The Table below sets out results of initial sequencing of S-layer related genes 
in 2 strains of C. crescentus (NA1000 and CB2) and four non-C. crescentus species of 
S-layer producing freshwater Caulobacter (FWC6, 8, 19, 27 and 39). Genes 

3 z identified as A, D and E are the S-layer structural gene, the ABC transporter gene and 

the membrane fusion protein (MPF) respectively. In the C. crescentus strains, the 
latter genes are rsaA, rsaD, and rsaE, respectively. The transport protein genes are 
highly conserved within and among the species. Within the C. crescentus species, 
there is high conservation of the rsaA gene, including the C-terminal secretion signal. 
3 5 A region of the S-layer gene in FWC27 outside the secretion signal region shows 
clear divergence from the equivalent region in the two C. crescentus strains. 
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Further sequencing studies comparing the S-Iayer protein gene (A), the 
ABC-transporter gene (D) and the MFP gene (E) of FWC1, 9, 19, 39 and 42 to the 
rsaA, rsaD and rsaE genes of C. crescentus strain CB15A produced the following 
5 results. FWC1 and FWC19 respectively exhibited about 32% and about 27% identity 
over the last 300 C -terminal amino acids of the A gene, as compared to rsaA. For the 
last 100 C-terminal amino acids, identity of FWC1 and FWC19 sequences to rsaA 
was about 50% and 41 % (respectively) t with the most significant identity being in the 
last 62 amino acids. Sequencing of various 35 to about 350 amino acid segments of 
10 the D gene from FWC1, FWC9, FWC19, FWC39 and FWC42 resulted in sequence 
identity to rsaD of at least about 79%. Sequencing of large portions of the E gene 
from FWC19 and FWC42 (about 368 and about 290 amino acids respectively) 
demonstrated about 85% and about 73% identity (respectively) to the rsaE gene. 

* 

15 Approximately the last 100 C-terminal amino acids of the A gene for FWC1 

(SEQ ID NO:6) and FWC19 (SEQ ID NO:7) are set out below. 

FWC1 

20 TTDTIJCF ANTG— TETFTSTKVDLTGVNDFTAALN AAAAGNGGGNGII 
TWFQYGGNTYI\'EDRDAGNTFN^ATDIV\TCLTGA\T)LST-AVLSAFGRRS 
SLTLV 

FWC19 

2 5 

RAHMIIJCPTRHVSDRWGRHVARLVQLPGRPCPKI^DAATTGNASHKV 
SWFVYGGDTYLVKN1STLAPPSKTARTTVVKLTGTTNDLTK— ATFDGAAH 
TLTLG 



30 



This invention now being described, it will be apparent to one of ordinary skill 
in the art that changes and modifications can be made thereto without departing from 
the spirit or scope of the appended claims. 
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WE CLAIM: 

1. A host cell for expression and secretion of a heterologous polypeptide, 
wherein the cell is a Caulobacter comprising at least one surface layer transport 

5 protein having an amino acid sequence homologous to SEQ ED NO:4 or SEQ ID 
NO:5, and wherein the host further comprises a DNA construct comprising DNA 
encoding a polypeptide heterologous to a surface layer protein of the cell 5' from and 
operably linked to DNA encoding a Caulobacter surface layer protein secretion signal, 
with the proviso that when the cell comprises transport proteins having the same 

10 sequence as both SEQ ID NO:4 and SEQ ID NO:5, the secretion signal is not from 
C. crescentus . 

% 2. The cell of claim 1 wherein at least one of the transport proteins of the cell has 

€1 an amino acid sequence the same as SEQ ID NO:4 or SEQ ID NO:5. 

UJ 15 

'f: 3. The cell of claim 2 having transport proteins with the same amino acid 

s f sequence as SEQ ID NO:4 and SEQ ID NO:5, and wherein the secretion signal does 

U. not comprise SEQ ID NO: L 

5) J20 4. The cell of claim 1 , 5^enSlwherein the DNA construct further comprises an 
C 7 operably linked promoter recognized by the cell. 

5. A method for identifying a Caulobacter suitable for use as a host ceil for 
expression and secretion of a heterologous polypeptide comprising: 

25 

(a) extracting DNA from a candidate Caulobacter ; 

(b) contacting the DNA with an oligonucleotide capable of selective 
hybridization to a nucleotide sequence encoding SEQ ID NO:4 or SEQ ID NO:5; and 

30 

(c) determining whether the oligonucleotide hybridizes to the DNA. 



WO 00/49163 



-39- 



PCT/CA00/00173 



6. The method of claim 5 wherein the oligonucleotide is labelled and said 
determining is by detection of the presence of the label bound to the DNA. 

7. The method of claim 5 wherein said determining is by amplification of DNA 
5 with the oligonucleotide as a primer, followed by detection of an amplification 

product. 

8. A DNA construct comprising one or more restriction sites for facilitating 
insertion of DNA into the construct, wherein the construct further comprises DNA 

10 encoding a Cauiobacter surface layer protein secretion signal not present in 
C. crescentus. 

9. A DNA construct comprising DNA encoding a polypeptide not present in 
Cauiobacter surface layer protein 5' from and operatively linked to DNA encoding a 

15 Cauiobacter surface layer protein secretion signal not present in C. crescentus . 

10. The DNA construct of claim 9 on±(fc&irther comprising an operably linked 
promoter recognized by Cauiobacter , 

2 0 11. The DNA construct of claim 8, jfcorrtfi^. wherein the secretion signal has an 
amino acid sequence which does not comprise SEQ ED NO:l. 

12. A bacterial cell comprising a DNA construct of claim 9,4*(kiQ.CrB*. 

2 5 13. The cell of claim 12, wherein the cell is a Cauiobacter . 

14. The cell of claim 12, wherein the cell is C. crescentus . 

15. The cell of claim 13 or 14, wherein the DNA construct further comprises an 
30 operably linked promoter recognized by Cauiobacter and wherein the DNA construct 

is expressed in the cell and the protein so expressed is secreted by the cell. 
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16. A secreted protein obtained from a cell of claim 15, wherein the protein 
comprises one or more portions heterologous to a surface layer protein of the cell and 
wherein the protein has a C-terminal portion comprising a surface layer protein 
secretion signal not present in C. crescentus . 
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